Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 2823 |
| Missing cells | 1562 |
| Missing cells (%) | 3.3% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.9 MiB |
| Average record size in memory | 701.7 B |
Variable types
| Numeric | 5 |
|---|---|
| DateTime | 1 |
| Categorical | 5 |
| Text | 6 |
DATA is highly overall correlated with QTR_ID and 1 other fields | High correlation |
QUANTITYORDERED is highly overall correlated with SALES | High correlation |
PRICEEACH is highly overall correlated with SALES | High correlation |
SALES is highly overall correlated with QUANTITYORDERED and 1 other fields | High correlation |
MONTH_ID is highly overall correlated with QTR_ID | High correlation |
QTR_ID is highly overall correlated with DATA and 1 other fields | High correlation |
YEAR_ID is highly overall correlated with DATA | High correlation |
STATE is highly overall correlated with COUNTRY | High correlation |
COUNTRY is highly overall correlated with STATE | High correlation |
STATE has 1486 (52.6%) missing values | Missing |
POSTALCODE has 76 (2.7%) missing values | Missing |
Reproduction
| Analysis started | 2023-06-24 13:43:54.174995 |
|---|---|
| Analysis finished | 2023-06-24 13:44:08.386221 |
| Duration | 14.21 seconds |
| Software version | ydata-profiling vv4.3.1 |
| Download configuration | config.json |
DATA
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 307 |
|---|---|
| Distinct (%) | 10.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10258.725 |
| Minimum | 10100 |
|---|---|
| Maximum | 10425 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.2 KiB |
Quantile statistics
| Minimum | 10100 |
|---|---|
| 5-th percentile | 10115 |
| Q1 | 10180 |
| median | 10262 |
| Q3 | 10333.5 |
| 95-th percentile | 10405 |
| Maximum | 10425 |
| Range | 325 |
| Interquartile range (IQR) | 153.5 |
Descriptive statistics
| Standard deviation | 92.085478 |
|---|---|
| Coefficient of variation (CV) | 0.0089763081 |
| Kurtosis | -1.1733092 |
| Mean | 10258.725 |
| Median Absolute Deviation (MAD) | 79 |
| Skewness | 0.013822989 |
| Sum | 28960381 |
| Variance | 8479.7352 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10332 | 18 | 0.6% |
| 10106 | 18 | 0.6% |
| 10159 | 18 | 0.6% |
| 10168 | 18 | 0.6% |
| 10398 | 18 | 0.6% |
| 10222 | 18 | 0.6% |
| 10165 | 18 | 0.6% |
| 10275 | 18 | 0.6% |
| 10316 | 18 | 0.6% |
| 10386 | 18 | 0.6% |
| Other values (297) | 2643 |
| Value | Count | Frequency (%) |
| 10100 | 4 | 0.1% |
| 10101 | 4 | 0.1% |
| 10102 | 2 | 0.1% |
| 10103 | 16 | |
| 10104 | 13 | |
| 10105 | 15 | |
| 10106 | 18 | |
| 10107 | 8 | |
| 10108 | 16 | |
| 10109 | 6 | 0.2% |
| Value | Count | Frequency (%) |
| 10425 | 13 | |
| 10424 | 6 | |
| 10423 | 5 | 0.2% |
| 10422 | 2 | 0.1% |
| 10421 | 2 | 0.1% |
| 10420 | 13 | |
| 10419 | 14 | |
| 10417 | 6 | |
| 10416 | 14 | |
| 10415 | 5 | 0.2% |
QUANTITYORDERED
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 58 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 35.092809 |
| Minimum | 6 |
|---|---|
| Maximum | 97 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.2 KiB |
Quantile statistics
| Minimum | 6 |
|---|---|
| 5-th percentile | 21 |
| Q1 | 27 |
| median | 35 |
| Q3 | 43 |
| 95-th percentile | 49 |
| Maximum | 97 |
| Range | 91 |
| Interquartile range (IQR) | 16 |
Descriptive statistics
| Standard deviation | 9.7414427 |
|---|---|
| Coefficient of variation (CV) | 0.27759085 |
| Kurtosis | 0.41574379 |
| Mean | 35.092809 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.36258533 |
| Sum | 99067 |
| Variance | 94.895707 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 34 | 112 | 4.0% |
| 21 | 103 | 3.6% |
| 46 | 101 | 3.6% |
| 27 | 100 | 3.5% |
| 31 | 97 | 3.4% |
| 41 | 97 | 3.4% |
| 45 | 97 | 3.4% |
| 26 | 96 | 3.4% |
| 29 | 94 | 3.3% |
| 48 | 94 | 3.3% |
| Other values (48) | 1832 |
| Value | Count | Frequency (%) |
| 6 | 2 | 0.1% |
| 10 | 2 | 0.1% |
| 11 | 2 | 0.1% |
| 12 | 1 | < 0.1% |
| 13 | 1 | < 0.1% |
| 15 | 4 | 0.1% |
| 16 | 1 | < 0.1% |
| 18 | 3 | 0.1% |
| 19 | 3 | 0.1% |
| 20 | 93 |
| Value | Count | Frequency (%) |
| 97 | 1 | < 0.1% |
| 85 | 1 | < 0.1% |
| 77 | 1 | < 0.1% |
| 76 | 3 | |
| 70 | 2 | 0.1% |
| 66 | 5 | |
| 65 | 1 | < 0.1% |
| 64 | 3 | |
| 62 | 1 | < 0.1% |
| 61 | 3 |
PRICEEACH
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 1016 |
|---|---|
| Distinct (%) | 36.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 83.658544 |
| Minimum | 26.88 |
|---|---|
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.2 KiB |
Quantile statistics
| Minimum | 26.88 |
|---|---|
| 5-th percentile | 42.67 |
| Q1 | 68.86 |
| median | 95.7 |
| Q3 | 100 |
| 95-th percentile | 100 |
| Maximum | 100 |
| Range | 73.12 |
| Interquartile range (IQR) | 31.14 |
Descriptive statistics
| Standard deviation | 20.174277 |
|---|---|
| Coefficient of variation (CV) | 0.24115022 |
| Kurtosis | -0.37481769 |
| Mean | 83.658544 |
| Median Absolute Deviation (MAD) | 4.3 |
| Skewness | -0.94664886 |
| Sum | 236168.07 |
| Variance | 407.00143 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 1304 | |
| 59.87 | 6 | 0.2% |
| 96.34 | 6 | 0.2% |
| 57.73 | 5 | 0.2% |
| 80.55 | 5 | 0.2% |
| 90.17 | 5 | 0.2% |
| 67.14 | 5 | 0.2% |
| 61.99 | 5 | 0.2% |
| 89.38 | 5 | 0.2% |
| 51.93 | 5 | 0.2% |
| Other values (1006) | 1472 |
| Value | Count | Frequency (%) |
| 26.88 | 1 | < 0.1% |
| 27.22 | 1 | < 0.1% |
| 28.29 | 1 | < 0.1% |
| 28.88 | 1 | < 0.1% |
| 29.21 | 2 | |
| 29.54 | 3 | |
| 29.7 | 1 | < 0.1% |
| 29.87 | 1 | < 0.1% |
| 30.06 | 2 | |
| 30.2 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 100 | 1304 | |
| 99.91 | 1 | < 0.1% |
| 99.82 | 2 | 0.1% |
| 99.72 | 1 | < 0.1% |
| 99.69 | 1 | < 0.1% |
| 99.67 | 1 | < 0.1% |
| 99.66 | 1 | < 0.1% |
| 99.58 | 1 | < 0.1% |
| 99.57 | 1 | < 0.1% |
| 99.55 | 2 | 0.1% |
SALES
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 2763 |
|---|---|
| Distinct (%) | 97.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3553.8891 |
| Minimum | 482.13 |
|---|---|
| Maximum | 14082.8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.2 KiB |
Quantile statistics
| Minimum | 482.13 |
|---|---|
| 5-th percentile | 1268.757 |
| Q1 | 2203.43 |
| median | 3184.8 |
| Q3 | 4508 |
| 95-th percentile | 7108.12 |
| Maximum | 14082.8 |
| Range | 13600.67 |
| Interquartile range (IQR) | 2304.57 |
Descriptive statistics
| Standard deviation | 1841.8651 |
|---|---|
| Coefficient of variation (CV) | 0.51826747 |
| Kurtosis | 1.7926765 |
| Mean | 3553.8891 |
| Median Absolute Deviation (MAD) | 1102.31 |
| Skewness | 1.161076 |
| Sum | 10032629 |
| Variance | 3392467.1 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3003 | 3 | 0.1% |
| 5464.69 | 2 | 0.1% |
| 2257.92 | 2 | 0.1% |
| 5004.8 | 2 | 0.1% |
| 2172.48 | 2 | 0.1% |
| 4948.2 | 2 | 0.1% |
| 2213.4 | 2 | 0.1% |
| 2441.04 | 2 | 0.1% |
| 3184.8 | 2 | 0.1% |
| 1463 | 2 | 0.1% |
| Other values (2753) | 2802 |
| Value | Count | Frequency (%) |
| 482.13 | 1 | |
| 541.14 | 1 | |
| 553.95 | 1 | |
| 577.6 | 1 | |
| 640.05 | 1 | |
| 651.8 | 1 | |
| 652.35 | 1 | |
| 683.8 | 1 | |
| 694.6 | 1 | |
| 703.6 | 1 |
| Value | Count | Frequency (%) |
| 14082.8 | 1 | |
| 12536.5 | 1 | |
| 12001 | 1 | |
| 11887.8 | 1 | |
| 11886.6 | 1 | |
| 11739.7 | 1 | |
| 11623.7 | 1 | |
| 11336.7 | 1 | |
| 11279.2 | 1 | |
| 10993.5 | 1 |
ORDERDATE
Date
| Distinct | 252 |
|---|---|
| Distinct (%) | 8.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 22.2 KiB |
| Minimum | 2003-01-06 00:00:00 |
|---|---|
| Maximum | 2005-05-31 00:00:00 |
QTR_ID
Categorical
HIGH CORRELATION 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 160.0 KiB |
| 4 | |
|---|---|
| 1 | |
| 2 | |
| 3 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2823 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 2 |
| 3rd row | 3 |
| 4th row | 3 |
| 5th row | 4 |
Common Values
| Value | Count | Frequency (%) |
| 4 | 1094 | |
| 1 | 665 | |
| 2 | 561 | |
| 3 | 503 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 4 | 1094 | |
| 1 | 665 | |
| 2 | 561 | |
| 3 | 503 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 1094 | |
| 1 | 665 | |
| 2 | 561 | |
| 3 | 503 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2823 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 1094 | |
| 1 | 665 | |
| 2 | 561 | |
| 3 | 503 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2823 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 1094 | |
| 1 | 665 | |
| 2 | 561 | |
| 3 | 503 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2823 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 1094 | |
| 1 | 665 | |
| 2 | 561 | |
| 3 | 503 |
MONTH_ID
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.0924548 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 8 |
| Q3 | 11 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 3.6566333 |
|---|---|
| Coefficient of variation (CV) | 0.51556667 |
| Kurtosis | -1.3832748 |
| Mean | 7.0924548 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.27290156 |
| Sum | 20022 |
| Variance | 13.370967 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 11 | 597 | |
| 10 | 317 | |
| 5 | 252 | |
| 1 | 229 | 8.1% |
| 2 | 224 | 7.9% |
| 3 | 212 | 7.5% |
| 8 | 191 | 6.8% |
| 12 | 180 | 6.4% |
| 4 | 178 | 6.3% |
| 9 | 171 | 6.1% |
| Other values (2) | 272 |
| Value | Count | Frequency (%) |
| 1 | 229 | |
| 2 | 224 | |
| 3 | 212 | |
| 4 | 178 | |
| 5 | 252 | |
| 6 | 131 | |
| 7 | 141 | |
| 8 | 191 | |
| 9 | 171 | |
| 10 | 317 |
| Value | Count | Frequency (%) |
| 12 | 180 | 6.4% |
| 11 | 597 | |
| 10 | 317 | |
| 9 | 171 | 6.1% |
| 8 | 191 | 6.8% |
| 7 | 141 | 5.0% |
| 6 | 131 | 4.6% |
| 5 | 252 | |
| 4 | 178 | 6.3% |
| 3 | 212 | 7.5% |
YEAR_ID
Categorical
HIGH CORRELATION 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 168.3 KiB |
| 2004 | |
|---|---|
| 2003 | |
| 2005 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 11292 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2003 |
|---|---|
| 2nd row | 2003 |
| 3rd row | 2003 |
| 4th row | 2003 |
| 5th row | 2003 |
Common Values
| Value | Count | Frequency (%) |
| 2004 | 1345 | |
| 2003 | 1000 | |
| 2005 | 478 | 16.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2004 | 1345 | |
| 2003 | 1000 | |
| 2005 | 478 | 16.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 5646 | |
| 2 | 2823 | |
| 4 | 1345 | 11.9% |
| 3 | 1000 | 8.9% |
| 5 | 478 | 4.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 11292 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5646 | |
| 2 | 2823 | |
| 4 | 1345 | 11.9% |
| 3 | 1000 | 8.9% |
| 5 | 478 | 4.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 11292 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 5646 | |
| 2 | 2823 | |
| 4 | 1345 | 11.9% |
| 3 | 1000 | 8.9% |
| 5 | 478 | 4.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11292 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 5646 | |
| 2 | 2823 | |
| 4 | 1345 | 11.9% |
| 3 | 1000 | 8.9% |
| 5 | 478 | 4.2% |
PRODUCTLINE
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 187.4 KiB |
| Classic Cars | |
|---|---|
| Vintage Cars | |
| Motorcycles | |
| Planes | |
| Trucks and Buses | |
| Other values (2) |
Length
| Max length | 16 |
|---|---|
| Median length | 12 |
| Mean length | 10.914984 |
| Min length | 5 |
Characters and Unicode
| Total characters | 30813 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Motorcycles |
|---|---|
| 2nd row | Motorcycles |
| 3rd row | Motorcycles |
| 4th row | Motorcycles |
| 5th row | Motorcycles |
Common Values
| Value | Count | Frequency (%) |
| Classic Cars | 967 | |
| Vintage Cars | 607 | |
| Motorcycles | 331 | 11.7% |
| Planes | 306 | 10.8% |
| Trucks and Buses | 301 | 10.7% |
| Ships | 234 | 8.3% |
| Trains | 77 | 2.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| cars | 1574 | |
| classic | 967 | |
| vintage | 607 | 12.1% |
| motorcycles | 331 | 6.6% |
| planes | 306 | 6.1% |
| trucks | 301 | 6.0% |
| and | 301 | 6.0% |
| buses | 301 | 6.0% |
| ships | 234 | 4.7% |
| trains | 77 | 1.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 5359 | |
| a | 3832 | |
| C | 2541 | 8.2% |
| r | 2283 | 7.4% |
| 2176 | 7.1% | |
| c | 1930 | 6.3% |
| i | 1885 | 6.1% |
| l | 1604 | 5.2% |
| e | 1545 | 5.0% |
| n | 1291 | 4.2% |
| Other values (15) | 6367 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 23939 | |
| Uppercase Letter | 4698 | 15.2% |
| Space Separator | 2176 | 7.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 5359 | |
| a | 3832 | |
| r | 2283 | |
| c | 1930 | 8.1% |
| i | 1885 | 7.9% |
| l | 1604 | 6.7% |
| e | 1545 | 6.5% |
| n | 1291 | 5.4% |
| t | 938 | 3.9% |
| o | 662 | 2.8% |
| Other values (7) | 2610 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2541 | |
| V | 607 | 12.9% |
| T | 378 | 8.0% |
| M | 331 | 7.0% |
| P | 306 | 6.5% |
| B | 301 | 6.4% |
| S | 234 | 5.0% |
Space Separator
| Value | Count | Frequency (%) |
| 2176 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 28637 | |
| Common | 2176 | 7.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 5359 | |
| a | 3832 | |
| C | 2541 | |
| r | 2283 | |
| c | 1930 | 6.7% |
| i | 1885 | 6.6% |
| l | 1604 | 5.6% |
| e | 1545 | 5.4% |
| n | 1291 | 4.5% |
| t | 938 | 3.3% |
| Other values (14) | 5429 |
Common
| Value | Count | Frequency (%) |
| 2176 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30813 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 5359 | |
| a | 3832 | |
| C | 2541 | 8.2% |
| r | 2283 | 7.4% |
| 2176 | 7.1% | |
| c | 1930 | 6.3% |
| i | 1885 | 6.1% |
| l | 1604 | 5.2% |
| e | 1545 | 5.0% |
| n | 1291 | 4.2% |
| Other values (15) | 6367 |
PHONE
Text
| Distinct | 91 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 189.3 KiB |
Length
| Max length | 17 |
|---|---|
| Median length | 10 |
| Mean length | 11.636557 |
| Min length | 9 |
Characters and Unicode
| Total characters | 32850 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 7 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2125557818 |
|---|---|
| 2nd row | 26.47.1555 |
| 3rd row | +33 1 46 62 7555 |
| 4th row | 6265557265 |
| 5th row | 6505551386 |
| Value | Count | Frequency (%) |
| 555 | 375 | 6.7% |
| 91 | 291 | 5.2% |
| 94 | 259 | 4.6% |
| 44 | 259 | 4.6% |
| 4155551450 | 180 | 3.2% |
| 8555 | 122 | 2.2% |
| 171 | 118 | 2.1% |
| 3555 | 82 | 1.5% |
| 65 | 79 | 1.4% |
| 4555 | 78 | 1.4% |
| Other values (127) | 3750 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 10957 | |
| 2770 | 8.4% | |
| 4 | 2554 | 7.8% |
| 1 | 2528 | 7.7% |
| 2 | 2161 | 6.6% |
| 9 | 1685 | 5.1% |
| 6 | 1685 | 5.1% |
| 0 | 1623 | 4.9% |
| 8 | 1476 | 4.5% |
| 3 | 1285 | 3.9% |
| Other values (6) | 4126 | 12.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 27165 | |
| Space Separator | 2770 | 8.4% |
| Dash Punctuation | 704 | 2.1% |
| Open Punctuation | 626 | 1.9% |
| Close Punctuation | 626 | 1.9% |
| Other Punctuation | 588 | 1.8% |
| Math Symbol | 371 | 1.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 10957 | |
| 4 | 2554 | 9.4% |
| 1 | 2528 | 9.3% |
| 2 | 2161 | 8.0% |
| 9 | 1685 | 6.2% |
| 6 | 1685 | 6.2% |
| 0 | 1623 | 6.0% |
| 8 | 1476 | 5.4% |
| 3 | 1285 | 4.7% |
| 7 | 1211 | 4.5% |
Space Separator
| Value | Count | Frequency (%) |
| 2770 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 704 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 626 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 626 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 588 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 371 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 32850 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 10957 | |
| 2770 | 8.4% | |
| 4 | 2554 | 7.8% |
| 1 | 2528 | 7.7% |
| 2 | 2161 | 6.6% |
| 9 | 1685 | 5.1% |
| 6 | 1685 | 5.1% |
| 0 | 1623 | 4.9% |
| 8 | 1476 | 4.5% |
| 3 | 1285 | 3.9% |
| Other values (6) | 4126 | 12.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 32850 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 10957 | |
| 2770 | 8.4% | |
| 4 | 2554 | 7.8% |
| 1 | 2528 | 7.7% |
| 2 | 2161 | 6.6% |
| 9 | 1685 | 5.1% |
| 6 | 1685 | 5.1% |
| 0 | 1623 | 4.9% |
| 8 | 1476 | 4.5% |
| 3 | 1285 | 3.9% |
| Other values (6) | 4126 | 12.6% |
ADDRESSLINE1
Text
| Distinct | 92 |
|---|---|
| Distinct (%) | 3.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 212.7 KiB |
Length
| Max length | 42 |
|---|---|
| Median length | 36 |
| Mean length | 19.445979 |
| Min length | 11 |
Characters and Unicode
| Total characters | 54896 |
|---|---|
| Distinct characters | 67 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 897 Long Airport Avenue |
|---|---|
| 2nd row | 59 rue de l'Abbaye |
| 3rd row | 27 rue du Colonel Pierre Avia |
| 4th row | 78934 Hillside Dr. |
| 5th row | 7734 Strong St. |
| Value | Count | Frequency (%) |
| st | 442 | 4.6% |
| c | 306 | 3.2% |
| rue | 281 | 2.9% |
| moralzarzal | 259 | 2.7% |
| 86 | 259 | 2.7% |
| strong | 250 | 2.6% |
| street | 216 | 2.2% |
| 5677 | 180 | 1.9% |
| furth | 135 | 1.4% |
| circle | 135 | 1.4% |
| Other values (210) | 7216 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6904 | 12.6% | |
| e | 3914 | 7.1% |
| r | 3579 | 6.5% |
| a | 2883 | 5.3% |
| t | 2545 | 4.6% |
| n | 2485 | 4.5% |
| o | 2437 | 4.4% |
| l | 1979 | 3.6% |
| i | 1901 | 3.5% |
| u | 1438 | 2.6% |
| Other values (57) | 24831 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 30949 | |
| Decimal Number | 8168 | 14.9% |
| Space Separator | 6904 | 12.6% |
| Uppercase Letter | 6337 | 11.5% |
| Other Punctuation | 2226 | 4.1% |
| Dash Punctuation | 270 | 0.5% |
| Currency Symbol | 23 | < 0.1% |
| Control | 19 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3914 | |
| r | 3579 | |
| a | 2883 | |
| t | 2545 | 8.2% |
| n | 2485 | 8.0% |
| o | 2437 | 7.9% |
| l | 1979 | 6.4% |
| i | 1901 | 6.1% |
| u | 1438 | 4.6% |
| s | 1187 | 3.8% |
| Other values (15) | 6601 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1303 | |
| C | 826 | |
| M | 586 | |
| A | 464 | 7.3% |
| B | 379 | 6.0% |
| L | 357 | 5.6% |
| P | 310 | 4.9% |
| D | 305 | 4.8% |
| R | 268 | 4.2% |
| F | 253 | 4.0% |
| Other values (12) | 1286 |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 1132 | |
| 6 | 1129 | |
| 2 | 983 | |
| 5 | 918 | |
| 8 | 872 | |
| 3 | 871 | |
| 4 | 790 | |
| 1 | 667 | |
| 0 | 427 | 5.2% |
| 9 | 379 | 4.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 910 | |
| , | 795 | |
| / | 349 | 15.7% |
| ' | 108 | 4.9% |
| ? | 38 | 1.7% |
| # | 26 | 1.2% |
Space Separator
| Value | Count | Frequency (%) |
| 6904 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 270 |
Currency Symbol
| Value | Count | Frequency (%) |
| ¤ | 23 |
Control
| Value | Count | Frequency (%) |
| „ | 19 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 37286 | |
| Common | 17610 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3914 | 10.5% |
| r | 3579 | 9.6% |
| a | 2883 | 7.7% |
| t | 2545 | 6.8% |
| n | 2485 | 6.7% |
| o | 2437 | 6.5% |
| l | 1979 | 5.3% |
| i | 1901 | 5.1% |
| u | 1438 | 3.9% |
| S | 1303 | 3.5% |
| Other values (37) | 12822 |
Common
| Value | Count | Frequency (%) |
| 6904 | ||
| 7 | 1132 | 6.4% |
| 6 | 1129 | 6.4% |
| 2 | 983 | 5.6% |
| 5 | 918 | 5.2% |
| . | 910 | 5.2% |
| 8 | 872 | 5.0% |
| 3 | 871 | 4.9% |
| , | 795 | 4.5% |
| 4 | 790 | 4.5% |
| Other values (10) | 2306 | 13.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 54854 | |
| None | 42 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6904 | 12.6% | |
| e | 3914 | 7.1% |
| r | 3579 | 6.5% |
| a | 2883 | 5.3% |
| t | 2545 | 4.6% |
| n | 2485 | 4.5% |
| o | 2437 | 4.4% |
| l | 1979 | 3.6% |
| i | 1901 | 3.5% |
| u | 1438 | 2.6% |
| Other values (55) | 24789 |
None
| Value | Count | Frequency (%) |
| ¤ | 23 | |
| „ | 19 |
CITY
Text
| Distinct | 73 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 178.6 KiB |
Length
| Max length | 14 |
|---|---|
| Median length | 12 |
| Mean length | 7.7530995 |
| Min length | 3 |
Characters and Unicode
| Total characters | 21887 |
|---|---|
| Distinct characters | 47 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NYC |
|---|---|
| 2nd row | Reims |
| 3rd row | Paris |
| 4th row | Pasadena |
| 5th row | San Francisco |
| Value | Count | Frequency (%) |
| san | 307 | 9.0% |
| madrid | 304 | 8.9% |
| rafael | 180 | 5.3% |
| nyc | 152 | 4.4% |
| singapore | 79 | 2.3% |
| new | 78 | 2.3% |
| paris | 70 | 2.0% |
| francisco | 62 | 1.8% |
| bedford | 61 | 1.8% |
| nantes | 60 | 1.8% |
| Other values (72) | 2073 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2614 | 11.9% |
| e | 2008 | 9.2% |
| n | 1562 | 7.1% |
| r | 1501 | 6.9% |
| i | 1327 | 6.1% |
| o | 1298 | 5.9% |
| l | 1083 | 4.9% |
| s | 1049 | 4.8% |
| d | 1019 | 4.7% |
| 603 | 2.8% | |
| Other values (37) | 7823 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 17522 | |
| Uppercase Letter | 3730 | 17.0% |
| Space Separator | 603 | 2.8% |
| Dash Punctuation | 32 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2614 | |
| e | 2008 | |
| n | 1562 | |
| r | 1501 | |
| i | 1327 | 7.6% |
| o | 1298 | 7.4% |
| l | 1083 | 6.2% |
| s | 1049 | 6.0% |
| d | 1019 | 5.8% |
| t | 523 | 3.0% |
| Other values (14) | 3538 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 553 | |
| M | 529 | |
| B | 417 | |
| N | 391 | |
| C | 296 | |
| R | 260 | 7.0% |
| L | 190 | 5.1% |
| P | 170 | 4.6% |
| Y | 152 | 4.1% |
| G | 91 | 2.4% |
| Other values (11) | 681 |
Space Separator
| Value | Count | Frequency (%) |
| 603 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 32 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 21252 | |
| Common | 635 | 2.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2614 | 12.3% |
| e | 2008 | 9.4% |
| n | 1562 | 7.3% |
| r | 1501 | 7.1% |
| i | 1327 | 6.2% |
| o | 1298 | 6.1% |
| l | 1083 | 5.1% |
| s | 1049 | 4.9% |
| d | 1019 | 4.8% |
| S | 553 | 2.6% |
| Other values (35) | 7238 |
Common
| Value | Count | Frequency (%) |
| 603 | ||
| - | 32 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21887 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2614 | 11.9% |
| e | 2008 | 9.2% |
| n | 1562 | 7.1% |
| r | 1501 | 6.9% |
| i | 1327 | 6.1% |
| o | 1298 | 5.9% |
| l | 1083 | 4.9% |
| s | 1049 | 4.8% |
| d | 1019 | 4.7% |
| 603 | 2.8% | |
| Other values (37) | 7823 |
STATE
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 16 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 1486 |
| Missing (%) | 52.6% |
| Memory size | 124.8 KiB |
| CA | |
|---|---|
| MA | |
| NY | |
| NSW | |
| Victoria | |
| Other values (11) |
Length
| Max length | 13 |
|---|---|
| Median length | 2 |
| Mean length | 2.9050112 |
| Min length | 2 |
Characters and Unicode
| Total characters | 3884 |
|---|---|
| Distinct characters | 35 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NY |
|---|---|
| 2nd row | CA |
| 3rd row | CA |
| 4th row | CA |
| 5th row | CA |
Common Values
| Value | Count | Frequency (%) |
| CA | 416 | 14.7% |
| MA | 190 | 6.7% |
| NY | 178 | 6.3% |
| NSW | 92 | 3.3% |
| Victoria | 78 | 2.8% |
| PA | 75 | 2.7% |
| CT | 61 | 2.2% |
| BC | 48 | 1.7% |
| NH | 34 | 1.2% |
| Tokyo | 32 | 1.1% |
| Other values (6) | 133 | 4.7% |
| (Missing) | 1486 |
Length
| Value | Count | Frequency (%) |
| ca | 416 | |
| ma | 190 | |
| ny | 178 | |
| nsw | 92 | 6.6% |
| victoria | 78 | 5.6% |
| pa | 75 | 5.4% |
| ct | 61 | 4.4% |
| bc | 48 | 3.5% |
| nh | 34 | 2.4% |
| tokyo | 32 | 2.3% |
| Other values (8) | 185 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 681 | |
| C | 525 | |
| N | 354 | 9.1% |
| M | 190 | 4.9% |
| i | 182 | 4.7% |
| Y | 178 | 4.6% |
| o | 168 | 4.3% |
| a | 133 | 3.4% |
| W | 118 | 3.0% |
| V | 107 | 2.8% |
| Other values (25) | 1248 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2599 | |
| Lowercase Letter | 1233 | |
| Space Separator | 52 | 1.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 182 | |
| o | 168 | |
| a | 133 | |
| t | 104 | |
| e | 100 | |
| c | 100 | |
| r | 78 | 6.3% |
| s | 61 | 4.9% |
| k | 52 | 4.2% |
| l | 41 | 3.3% |
| Other values (8) | 214 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 681 | |
| C | 525 | |
| N | 354 | |
| M | 190 | 7.3% |
| Y | 178 | 6.8% |
| W | 118 | 4.5% |
| V | 107 | 4.1% |
| T | 93 | 3.6% |
| S | 92 | 3.5% |
| P | 75 | 2.9% |
| Other values (6) | 186 | 7.2% |
Space Separator
| Value | Count | Frequency (%) |
| 52 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3832 | |
| Common | 52 | 1.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 681 | |
| C | 525 | |
| N | 354 | 9.2% |
| M | 190 | 5.0% |
| i | 182 | 4.7% |
| Y | 178 | 4.6% |
| o | 168 | 4.4% |
| a | 133 | 3.5% |
| W | 118 | 3.1% |
| V | 107 | 2.8% |
| Other values (24) | 1196 |
Common
| Value | Count | Frequency (%) |
| 52 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3884 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 681 | |
| C | 525 | |
| N | 354 | 9.1% |
| M | 190 | 4.9% |
| i | 182 | 4.7% |
| Y | 178 | 4.6% |
| o | 168 | 4.3% |
| a | 133 | 3.4% |
| W | 118 | 3.0% |
| V | 107 | 2.8% |
| Other values (25) | 1248 |
POSTALCODE
Text
MISSING 
| Distinct | 73 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 76 |
| Missing (%) | 2.7% |
| Memory size | 169.4 KiB |
Length
| Max length | 9 |
|---|---|
| Median length | 5 |
| Mean length | 5.2133236 |
| Min length | 1 |
Characters and Unicode
| Total characters | 14321 |
|---|---|
| Distinct characters | 32 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 10022 |
|---|---|
| 2nd row | 51100 |
| 3rd row | 75508 |
| 4th row | 90003 |
| 5th row | 94217 |
| Value | Count | Frequency (%) |
| 28034 | 259 | 8.4% |
| 97562 | 205 | 6.6% |
| 10022 | 152 | 4.9% |
| 94217 | 89 | 2.9% |
| 50553 | 61 | 2.0% |
| 44000 | 60 | 1.9% |
| 3004 | 55 | 1.8% |
| n | 53 | 1.7% |
| ec2 | 51 | 1.6% |
| 5nt | 51 | 1.6% |
| Other values (75) | 2061 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3173 | |
| 2 | 1890 | |
| 1 | 1434 | |
| 4 | 1044 | 7.3% |
| 3 | 1035 | 7.2% |
| 5 | 990 | 6.9% |
| 7 | 947 | 6.6% |
| 9 | 763 | 5.3% |
| 6 | 740 | 5.2% |
| 8 | 712 | 5.0% |
| Other values (22) | 1593 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 12728 | |
| Uppercase Letter | 1071 | 7.5% |
| Space Separator | 350 | 2.4% |
| Dash Punctuation | 172 | 1.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 134 | |
| T | 106 | |
| F | 104 | |
| W | 93 | 8.7% |
| M | 78 | 7.3% |
| C | 73 | 6.8% |
| P | 64 | 6.0% |
| S | 57 | 5.3% |
| X | 55 | 5.1% |
| E | 51 | 4.8% |
| Other values (10) | 256 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3173 | |
| 2 | 1890 | |
| 1 | 1434 | |
| 4 | 1044 | 8.2% |
| 3 | 1035 | 8.1% |
| 5 | 990 | 7.8% |
| 7 | 947 | 7.4% |
| 9 | 763 | 6.0% |
| 6 | 740 | 5.8% |
| 8 | 712 | 5.6% |
Space Separator
| Value | Count | Frequency (%) |
| 350 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 172 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 13250 | |
| Latin | 1071 | 7.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 134 | |
| T | 106 | |
| F | 104 | |
| W | 93 | 8.7% |
| M | 78 | 7.3% |
| C | 73 | 6.8% |
| P | 64 | 6.0% |
| S | 57 | 5.3% |
| X | 55 | 5.1% |
| E | 51 | 4.8% |
| Other values (10) | 256 |
Common
| Value | Count | Frequency (%) |
| 0 | 3173 | |
| 2 | 1890 | |
| 1 | 1434 | |
| 4 | 1044 | 7.9% |
| 3 | 1035 | 7.8% |
| 5 | 990 | 7.5% |
| 7 | 947 | 7.1% |
| 9 | 763 | 5.8% |
| 6 | 740 | 5.6% |
| 8 | 712 | 5.4% |
| Other values (2) | 522 | 3.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14321 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3173 | |
| 2 | 1890 | |
| 1 | 1434 | |
| 4 | 1044 | 7.3% |
| 3 | 1035 | 7.2% |
| 5 | 990 | 6.9% |
| 7 | 947 | 6.6% |
| 9 | 763 | 5.3% |
| 6 | 740 | 5.2% |
| 8 | 712 | 5.0% |
| Other values (22) | 1593 |
COUNTRY
Categorical
HIGH CORRELATION 
| Distinct | 19 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 171.2 KiB |
| USA | |
|---|---|
| Spain | |
| France | |
| Australia | |
| UK | |
| Other values (14) |
Length
| Max length | 11 |
|---|---|
| Median length | 9 |
| Mean length | 5.0446334 |
| Min length | 2 |
Characters and Unicode
| Total characters | 14241 |
|---|---|
| Distinct characters | 33 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | USA |
|---|---|
| 2nd row | France |
| 3rd row | France |
| 4th row | USA |
| 5th row | USA |
Common Values
| Value | Count | Frequency (%) |
| USA | 1004 | |
| Spain | 342 | 12.1% |
| France | 314 | 11.1% |
| Australia | 185 | 6.6% |
| UK | 144 | 5.1% |
| Italy | 113 | 4.0% |
| Finland | 92 | 3.3% |
| Norway | 85 | 3.0% |
| Singapore | 79 | 2.8% |
| Canada | 70 | 2.5% |
| Other values (9) | 395 | 14.0% |
Length
| Value | Count | Frequency (%) |
| usa | 1004 | |
| spain | 342 | 12.1% |
| france | 314 | 11.1% |
| australia | 185 | 6.6% |
| uk | 144 | 5.1% |
| italy | 113 | 4.0% |
| finland | 92 | 3.3% |
| norway | 85 | 3.0% |
| singapore | 79 | 2.8% |
| canada | 70 | 2.5% |
| Other values (9) | 395 | 14.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1936 | |
| S | 1513 | |
| n | 1296 | 9.1% |
| A | 1244 | 8.7% |
| U | 1148 | 8.1% |
| i | 895 | 6.3% |
| r | 890 | 6.2% |
| e | 738 | 5.2% |
| p | 525 | 3.7% |
| l | 496 | 3.5% |
| Other values (23) | 3560 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9266 | |
| Uppercase Letter | 4975 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1936 | |
| n | 1296 | |
| i | 895 | |
| r | 890 | |
| e | 738 | 8.0% |
| p | 525 | 5.7% |
| l | 496 | 5.4% |
| t | 384 | 4.1% |
| c | 314 | 3.4% |
| u | 273 | 2.9% |
| Other values (10) | 1519 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1513 | |
| A | 1244 | |
| U | 1148 | |
| F | 406 | 8.2% |
| K | 144 | 2.9% |
| I | 129 | 2.6% |
| N | 85 | 1.7% |
| C | 70 | 1.4% |
| D | 63 | 1.3% |
| G | 62 | 1.2% |
| Other values (3) | 111 | 2.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14241 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1936 | |
| S | 1513 | |
| n | 1296 | 9.1% |
| A | 1244 | 8.7% |
| U | 1148 | 8.1% |
| i | 895 | 6.3% |
| r | 890 | 6.2% |
| e | 738 | 5.2% |
| p | 525 | 3.7% |
| l | 496 | 3.5% |
| Other values (23) | 3560 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14241 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1936 | |
| S | 1513 | |
| n | 1296 | 9.1% |
| A | 1244 | 8.7% |
| U | 1148 | 8.1% |
| i | 895 | 6.3% |
| r | 890 | 6.2% |
| e | 738 | 5.2% |
| p | 525 | 3.7% |
| l | 496 | 3.5% |
| Other values (23) | 3560 |
CONTACTLASTNAME
Text
| Distinct | 77 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 175.0 KiB |
Length
| Max length | 11 |
|---|---|
| Median length | 9 |
| Mean length | 6.4413744 |
| Min length | 2 |
Characters and Unicode
| Total characters | 18184 |
|---|---|
| Distinct characters | 45 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Yu |
|---|---|
| 2nd row | Henriot |
| 3rd row | Da Cunha |
| 4th row | Young |
| 5th row | Brown |
| Value | Count | Frequency (%) |
| freyre | 259 | 9.1% |
| nelson | 204 | 7.2% |
| young | 115 | 4.0% |
| frick | 91 | 3.2% |
| brown | 88 | 3.1% |
| yu | 80 | 2.8% |
| hernandez | 70 | 2.5% |
| ferguson | 55 | 1.9% |
| king | 54 | 1.9% |
| labrune | 53 | 1.9% |
| Other values (68) | 1774 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2297 | 12.6% |
| r | 1850 | 10.2% |
| n | 1769 | 9.7% |
| o | 1355 | 7.5% |
| a | 1137 | 6.3% |
| i | 952 | 5.2% |
| s | 759 | 4.2% |
| l | 701 | 3.9% |
| u | 647 | 3.6% |
| t | 579 | 3.2% |
| Other values (35) | 6138 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 15229 | |
| Uppercase Letter | 2889 | 15.9% |
| Other Punctuation | 46 | 0.3% |
| Space Separator | 20 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2297 | |
| r | 1850 | |
| n | 1769 | |
| o | 1355 | |
| a | 1137 | 7.5% |
| i | 952 | 6.3% |
| s | 759 | 5.0% |
| l | 701 | 4.6% |
| u | 647 | 4.2% |
| t | 579 | 3.8% |
| Other values (15) | 3183 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 458 | |
| H | 280 | |
| N | 247 | 8.5% |
| B | 229 | 7.9% |
| Y | 221 | 7.6% |
| K | 192 | 6.6% |
| S | 165 | 5.7% |
| C | 162 | 5.6% |
| L | 161 | 5.6% |
| T | 147 | 5.1% |
| Other values (8) | 627 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 46 |
Space Separator
| Value | Count | Frequency (%) |
| 20 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 18118 | |
| Common | 66 | 0.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2297 | 12.7% |
| r | 1850 | 10.2% |
| n | 1769 | 9.8% |
| o | 1355 | 7.5% |
| a | 1137 | 6.3% |
| i | 952 | 5.3% |
| s | 759 | 4.2% |
| l | 701 | 3.9% |
| u | 647 | 3.6% |
| t | 579 | 3.2% |
| Other values (33) | 6072 |
Common
| Value | Count | Frequency (%) |
| ' | 46 | |
| 20 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18184 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 2297 | 12.6% |
| r | 1850 | 10.2% |
| n | 1769 | 9.7% |
| o | 1355 | 7.5% |
| a | 1137 | 6.3% |
| i | 952 | 5.2% |
| s | 759 | 4.2% |
| l | 701 | 3.9% |
| u | 647 | 3.6% |
| t | 579 | 3.2% |
| Other values (35) | 6138 |
CONTACTFIRSTNAME
Text
| Distinct | 72 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 173.9 KiB |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 5.6680836 |
| Min length | 3 |
Characters and Unicode
| Total characters | 16001 |
|---|---|
| Distinct characters | 43 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Kwai |
|---|---|
| 2nd row | Paul |
| 3rd row | Daniel |
| 4th row | Julie |
| 5th row | Julie |
| Value | Count | Frequency (%) |
| diego | 259 | 9.0% |
| valarie | 257 | 8.9% |
| julie | 117 | 4.1% |
| sue | 84 | 2.9% |
| michael | 84 | 2.9% |
| juri | 60 | 2.1% |
| maria | 58 | 2.0% |
| peter | 55 | 1.9% |
| elizabeth | 55 | 1.9% |
| janine | 53 | 1.8% |
| Other values (64) | 1791 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2067 | |
| i | 1890 | |
| a | 1875 | 11.7% |
| r | 1069 | 6.7% |
| n | 1049 | 6.6% |
| l | 1017 | 6.4% |
| o | 846 | 5.3% |
| t | 573 | 3.6% |
| u | 505 | 3.2% |
| J | 420 | 2.6% |
| Other values (33) | 4690 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13046 | |
| Uppercase Letter | 2873 | 18.0% |
| Space Separator | 50 | 0.3% |
| Other Punctuation | 32 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2067 | |
| i | 1890 | |
| a | 1875 | |
| r | 1069 | |
| n | 1049 | |
| l | 1017 | |
| o | 846 | |
| t | 573 | 4.4% |
| u | 505 | 3.9% |
| g | 391 | 3.0% |
| Other values (13) | 1764 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 420 | |
| M | 389 | |
| V | 363 | |
| D | 359 | |
| A | 220 | |
| P | 204 | |
| S | 146 | 5.1% |
| K | 131 | 4.6% |
| E | 121 | 4.2% |
| W | 92 | 3.2% |
| Other values (8) | 428 |
Space Separator
| Value | Count | Frequency (%) |
| 50 |
Other Punctuation
| Value | Count | Frequency (%) |
| ¡ | 32 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15919 | |
| Common | 82 | 0.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2067 | |
| i | 1890 | |
| a | 1875 | |
| r | 1069 | 6.7% |
| n | 1049 | 6.6% |
| l | 1017 | 6.4% |
| o | 846 | 5.3% |
| t | 573 | 3.6% |
| u | 505 | 3.2% |
| J | 420 | 2.6% |
| Other values (31) | 4608 |
Common
| Value | Count | Frequency (%) |
| 50 | ||
| ¡ | 32 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15969 | |
| None | 32 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 2067 | |
| i | 1890 | |
| a | 1875 | |
| r | 1069 | 6.7% |
| n | 1049 | 6.6% |
| l | 1017 | 6.4% |
| o | 846 | 5.3% |
| t | 573 | 3.6% |
| u | 505 | 3.2% |
| J | 420 | 2.6% |
| Other values (32) | 4658 |
None
| Value | Count | Frequency (%) |
| ¡ | 32 |
| DATA | QUANTITYORDERED | PRICEEACH | SALES | MONTH_ID | QTR_ID | YEAR_ID | PRODUCTLINE | STATE | COUNTRY | |
|---|---|---|---|---|---|---|---|---|---|---|
| DATA | 1.000 | 0.043 | -0.004 | 0.021 | -0.012 | 0.833 | 0.954 | 0.000 | 0.336 | 0.257 |
| QUANTITYORDERED | 0.043 | 1.000 | 0.006 | 0.538 | -0.026 | 0.139 | 0.193 | 0.000 | 0.108 | 0.041 |
| PRICEEACH | -0.004 | 0.006 | 1.000 | 0.788 | 0.011 | 0.023 | 0.016 | 0.146 | 0.000 | 0.018 |
| SALES | 0.021 | 0.538 | 0.788 | 1.000 | -0.002 | 0.021 | 0.056 | 0.112 | 0.000 | 0.000 |
| MONTH_ID | -0.012 | -0.026 | 0.011 | -0.002 | 1.000 | 0.999 | 0.414 | 0.038 | 0.308 | 0.253 |
| QTR_ID | 0.833 | 0.139 | 0.023 | 0.021 | 0.999 | 1.000 | 0.380 | 0.020 | 0.356 | 0.236 |
| YEAR_ID | 0.954 | 0.193 | 0.016 | 0.056 | 0.414 | 0.380 | 1.000 | 0.005 | 0.314 | 0.224 |
| PRODUCTLINE | 0.000 | 0.000 | 0.146 | 0.112 | 0.038 | 0.020 | 0.005 | 1.000 | 0.180 | 0.157 |
| STATE | 0.336 | 0.108 | 0.000 | 0.000 | 0.308 | 0.356 | 0.314 | 0.180 | 1.000 | 0.996 |
| COUNTRY | 0.257 | 0.041 | 0.018 | 0.000 | 0.253 | 0.236 | 0.224 | 0.157 | 0.996 | 1.000 |
| DATA | QUANTITYORDERED | PRICEEACH | SALES | ORDERDATE | QTR_ID | MONTH_ID | YEAR_ID | PRODUCTLINE | PHONE | ADDRESSLINE1 | CITY | STATE | POSTALCODE | COUNTRY | CONTACTLASTNAME | CONTACTFIRSTNAME | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 10107 | 30 | 95.70 | 2871.00 | 2/24/2003 0:00 | 1 | 2 | 2003 | Motorcycles | 2125557818 | 897 Long Airport Avenue | NYC | NY | 10022 | USA | Yu | Kwai |
| 1 | 10121 | 34 | 81.35 | 2765.90 | 05-07-2003 00:00 | 2 | 5 | 2003 | Motorcycles | 26.47.1555 | 59 rue de l'Abbaye | Reims | NaN | 51100 | France | Henriot | Paul |
| 2 | 10134 | 41 | 94.74 | 3884.34 | 07-01-2003 00:00 | 3 | 7 | 2003 | Motorcycles | +33 1 46 62 7555 | 27 rue du Colonel Pierre Avia | Paris | NaN | 75508 | France | Da Cunha | Daniel |
| 3 | 10145 | 45 | 83.26 | 3746.70 | 8/25/2003 0:00 | 3 | 8 | 2003 | Motorcycles | 6265557265 | 78934 Hillside Dr. | Pasadena | CA | 90003 | USA | Young | Julie |
| 4 | 10159 | 49 | 100.00 | 5205.27 | 10-10-2003 00:00 | 4 | 10 | 2003 | Motorcycles | 6505551386 | 7734 Strong St. | San Francisco | CA | NaN | USA | Brown | Julie |
| 5 | 10168 | 36 | 96.66 | 3479.76 | 10/28/2003 0:00 | 4 | 10 | 2003 | Motorcycles | 6505556809 | 9408 Furth Circle | Burlingame | CA | 94217 | USA | Hirano | Juri |
| 6 | 10180 | 29 | 86.13 | 2497.77 | 11-11-2003 00:00 | 4 | 11 | 2003 | Motorcycles | 20.16.1555 | 184, chausse de Tournai | Lille | NaN | 59000 | France | Rance | Martine |
| 7 | 10188 | 48 | 100.00 | 5512.32 | 11/18/2003 0:00 | 4 | 11 | 2003 | Motorcycles | +47 2267 3215 | Drammen 121, PR 744 Sentrum | Bergen | NaN | N 5804 | Norway | Oeztan | Veysel |
| 8 | 10201 | 22 | 98.57 | 2168.54 | 12-01-2003 00:00 | 4 | 12 | 2003 | Motorcycles | 6505555787 | 5557 North Pendale Street | San Francisco | CA | NaN | USA | Murphy | Julie |
| 9 | 10211 | 41 | 100.00 | 4708.44 | 1/15/2004 0:00 | 1 | 1 | 2004 | Motorcycles | (1) 47.55.6555 | 25, rue Lauriston | Paris | NaN | 75016 | France | Perrier | Dominique |
| DATA | QUANTITYORDERED | PRICEEACH | SALES | ORDERDATE | QTR_ID | MONTH_ID | YEAR_ID | PRODUCTLINE | PHONE | ADDRESSLINE1 | CITY | STATE | POSTALCODE | COUNTRY | CONTACTLASTNAME | CONTACTFIRSTNAME | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2813 | 10293 | 32 | 60.06 | 1921.92 | 09-09-2004 00:00 | 3 | 9 | 2004 | Ships | 011-4988555 | Via Monte Bianco 34 | Torino | NaN | 10100 | Italy | Accorti | Paolo |
| 2814 | 10306 | 35 | 59.51 | 2082.85 | 10/14/2004 0:00 | 4 | 10 | 2004 | Ships | (171) 555-1555 | Fauntleroy Circus | Manchester | NaN | EC2 5NT | UK | Ashworth | Victoria |
| 2815 | 10315 | 40 | 55.69 | 2227.60 | 10/29/2004 0:00 | 4 | 10 | 2004 | Ships | 40.67.8555 | 67, rue des Cinquante Otages | Nantes | NaN | 44000 | France | Labrune | Janine |
| 2816 | 10327 | 37 | 86.74 | 3209.38 | 11-10-2004 00:00 | 4 | 11 | 2004 | Ships | 31 12 3555 | Vinb'ltet 34 | Kobenhavn | NaN | 1734 | Denmark | Petersen | Jytte |
| 2817 | 10337 | 42 | 97.16 | 4080.72 | 11/21/2004 0:00 | 4 | 11 | 2004 | Ships | 2125558493 | 5905 Pompton St. | NYC | NY | 10022 | USA | Hernandez | Maria |
| 2818 | 10350 | 20 | 100.00 | 2244.40 | 12-02-2004 00:00 | 4 | 12 | 2004 | Ships | (91) 555 94 44 | C/ Moralzarzal, 86 | Madrid | NaN | 28034 | Spain | Freyre | Diego |
| 2819 | 10373 | 29 | 100.00 | 3978.51 | 1/31/2005 0:00 | 1 | 1 | 2005 | Ships | 981-443655 | Torikatu 38 | Oulu | NaN | 90110 | Finland | Koskitalo | Pirkko |
| 2820 | 10386 | 43 | 100.00 | 5417.57 | 03-01-2005 00:00 | 1 | 3 | 2005 | Ships | (91) 555 94 44 | C/ Moralzarzal, 86 | Madrid | NaN | 28034 | Spain | Freyre | Diego |
| 2821 | 10397 | 34 | 62.24 | 2116.16 | 3/28/2005 0:00 | 1 | 3 | 2005 | Ships | 61.77.6555 | 1 rue Alsace-Lorraine | Toulouse | NaN | 31000 | France | Roulet | Annette |
| 2822 | 10414 | 47 | 65.52 | 3079.44 | 05-06-2005 00:00 | 2 | 5 | 2005 | Ships | 6175559555 | 8616 Spinnaker Dr. | Boston | MA | 51003 | USA | Yoshido | Juri |